Recognition Of Voice Using Mel Cepstral Coefficient & Vector Quantization

نویسندگان

  • Priyanka Mishra
  • Suyash Agrawal
چکیده

Human Voice is characteristic for an individual. The ability to recognize the speaker by his/her voice can be a valuable biometric tool with enormous commercial as well as academic potential. Commercially, it can be utilized for ensuring secure access to any system. Academically, it can shed light on the speech processing abilities of the brain as well as speech mechanism. In fact, this feature is being used preliminarily along with other biometrics including face and finger print recognition for commercial security products. Speaker recognition is the method of automatically identify who is speaking on the basis of individual information integrated in speech waves. There are two types of speaker recognition systems basically divided into two – classification: speaker identification and speaker verification. Speaker identification determines from which of the registered speakers a given utterance comes whereas speaker verification is the process of accepting or rejecting the claimed identity of a speaker .The fundamental difference between identification & verification modes is the number of decision alternatives. In the Identification mode the number of decision alternatives is equal to the size of the population, whereas in the verification mode there are only two alternatives, accept or reject the Identification claim, regardless of the size of population. Most applications of speaker recognition are actually speaker verifications. Speaker Recognition is of two types :Text Based and Text independent. In Text based approach, the speaker is identified by the utterance of some fixed piece of text while in the text independent approach the speaker is allowed to utter any text whatsoever.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Recognition using MFCC and Improved Weighted Vector Quantization Algorithm

Speaker recognition is one of the most essential tasks in the signal processing which identifies a person from characteristics of voices . In this paper we accomplish speaker recognition using Mel-frequency Cepstral Coefficient (MFCC) with Weighted Vector Quantization algorithm. By using MFCC, the feature extraction process is carried out. It is one of the nonlinear cepstral coefficient functio...

متن کامل

Speaker Identification and Verification using Vector Quantization and Mel Frequency Cepstral Coefficients

In the study of speaker recognition, Mel Frequency Cepstral Coefficient (MFCC) method is the best and most popular which is used to feature extraction. Further vector quantization technique is used to minimize the amount of data to be handled in recent years. In the present study, the Speaker Recognition using Mel Frequency Cepstral coefficients and vector Quantization for the letter “Zha” (in ...

متن کامل

Voice-based Age and Gender Recognition using Training Generative Sparse Model

Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...

متن کامل

Speaker Identification Based on Vector Quantization

In this paper a method of text-independent speaker recognition using discrete vector quantization is presented. The identification experiments were performed in a closed set of 599 speakers and two various types of features were tested: cepstral mean subtraction coefficients and mel-frequency cepstral coefficients. The effect of the various codebook size on the speaker identification performanc...

متن کامل

A Vector Quantization Approach for Voice Recognition Using Mel Frequency Cepstral Coefficient (MFCC): A Review

This paper presents a brief survey on Automatic Voice Recognition so as to provide a technological perspective and an appreciation of the fundamental progress that has been accomplished in area of voice communication. The voice is a signal of infinite information. After years of research and development the accuracy of automatic voice recognition remains one of the important research challenges...

متن کامل

Comparative Study of MFCC And LPC Algorithms for Gujrati Isolated Word Recognition

The study performs feature extraction for isolated word recognition using Mel-Frequency Cepstral Coefficient (MFCC) for Gujarati language. It explains feature extraction methods MFCC and Linear Predictive Coding (LPC) in brief. The paper compares the performances of MFCC and LPC features under Vector Quantization (VQ) method. The dataset comprising of males and females voices were trained and t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012